Incorporating Global Information into Secondary Structure Prediction with Hidden Markov Models of Protein Folds

نویسندگان

  • Valentina Di Francesco
  • Philip McQueen
  • Jean Garnier
  • Peter J. Munson
چکیده

Here we propose an approach to include global structural information in the secondary structure prediction procedure based on hidden Markov models (HMMs) of protein folds. We first identify the correct fold or 'topology' of a protein by means of the HMMs of topology families of proteins. Then the most likely structural model for that protein is used to modify the sequence of secondary structure states previously obtained with a prediction algorithm. Our goal is to investigate the effect on the prediction accuracy of including global structural information in the secondary structure prediction scheme, by means of the HMMs. We find that when the HMM of the predicted topology of a protein is used to adjust the secondary structure sequence predicted originally with the Quadratic-Logistic method, the cross-validated prediction accuracy (Q3) improves by 3%. The topology is correctly predicted in 68% of the cases. We conclude that this HMM based approach is a promising tool for effectively incorporating global structural information in the secondary structure prediction scheme.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fold recognition using predicted secondary structure sequences and hidden Markov models of protein folds.

We present an analysis of the blind predictions submitted to the fold recognition category for the second meeting on the Critical Assessment of techniques for protein Structure Prediction. Our method achieves fold recognition from predicted secondary structure sequences using hidden Markov models (HMMs) of protein folds. HMMs are trained only with experimentally derived secondary structure sequ...

متن کامل

Hidden Markov Model for protein secondary structure

We address the problem of protein secondary structure prediction with Hidden Markov Models. A 21-state model is built using biological knowledge and statistical analysis of sequence motifs in regular secondary structures. Sequence family information is integrated via the combination of independent predictions of homologous sequences and a weighting scheme. Prediction accuracy with single sequen...

متن کامل

Protein secondary structure prediction based on quintuplets

Simple hidden Markov models are proposed for predicting secondary structure of a protein from its amino acid sequence. Since the length of protein conformation segments varies in a narrow range, we ignore the duration effect of length distribution, and focus on inclusion of short range correlations of residues and of conformation states in the models. Conformation-independent and -dependent ami...

متن کامل

The Application Of Hidden Markov Models to Protein Secondary Structure Prediction

The functional properties of proteins depend upon their 3D structures, therefore, it is advantageous to deduce the 3D structure of a protein from its amino acid sequence. This is a difficult task because there are 20 different amino acids that can be combined into “many more different proteins than there are atoms in the known universe” [2]. De novo prediction methods often involve a first step...

متن کامل

An HMM posterior decoder for sequence feature prediction that includes homology information

MOTIVATION When predicting sequence features like transmembrane topology, signal peptides, coil-coil structures, protein secondary structure or genes, extra support can be gained from homologs. RESULTS We present here a general hidden Markov model (HMM) decoding algorithm that combines probabilities for sequence features of homologs by considering the average of the posterior label probabilit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Proceedings. International Conference on Intelligent Systems for Molecular Biology

دوره 5  شماره 

صفحات  -

تاریخ انتشار 1997